Inducing Search Keys for Name Filtering

نویسنده

  • Karl Branting
چکیده

This paper describes ETK (Ensemble of Transformation based Keys) a new algorithm for inducing search keys for name filtering. ETK has the low computational cost and ability to filter by phonetic similarity characteristic of phonetic keys but is adaptable to alternative similarity models. A preliminary empirical evaluation suggests that ETK may be well-suited for phonetic filtering applications such as recognizing alternative cross-lingual transliterations.1

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic key discovery for Data Linking

In the recent years, the Web of Data has increased significantly, containing a huge number of RDF triples. Integrating data described in different RDF datasets and creating semantic links among them, has become one of the most important goals of RDF applications. These links express semantic correspondences between ontology entities or data. Among the different kinds of semantic links that can ...

متن کامل

Generating and Using Name Keys for Fuzzy Matches: Calling a Third Party Dynamic Link Library as a Module in the SAS® System

Identifying duplicate customer records or tying together customer records from different sources is done most efficiently if names can be "fuzzy" matched against each other. While SAS implements the SOUNDEX algorithm it also allows use of external DLLs via the CALL statement, so specialist third party routines can be used, such as Search Software America’s (SSA) specialist search key technology...

متن کامل

Unsupervised Chinese Personal Name Recognition Using Search Session

Personal name recognition is an important part of named entity recognition in Web search query logs. An unsupervised method for Chinese personal name recognition in queries is proposed using search session. Based on seed personal names which are produced automatically by introducing Chinese surnames, a local expansion method is proposed by using search sessions in query logs;and by modeling the...

متن کامل

Using Interactive Search Elements in Digital Libraries

Background and Aim: Interaction in a digital library help users locating and accessing information and also assist them in creating knowledge, better perception, problem solving and recognition of dimension of resources. This paper tries to identify and introduce the components and elements that are used in interaction between user and system in search and retrieval of information in digital li...

متن کامل

Indexing Methods for Faster and More Effective Person Name Search

This paper compares several indexing methods for person names extracted from text, developed for an information retrieval system with requirements for fast approximate matching of noisy and multicultural Romanized names. Such matching algorithms are computationally expensive and unacceptably slow when used without an indexing or blocking step. The goal is to create a small candidate pool contai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007